Optimized M2L Kernels for the Chebyshev Interpolation based Fast Multipole Method
نویسندگان
چکیده
A fast multipole method (FMM) for asymptotically smooth kernel functions (1/r, 1/r4, Gauss and Stokes kernels, radial basis functions, etc.) based on a Chebyshev interpolation scheme has been introduced in [5]. The method has been extended to oscillatory kernels (e.g., Helmholtz kernel) in [12]. Beside its generality this FMM turns out to be favorable due to its easy implementation and its high performance based on intensive use of highly optimized BLAS libraries. However, one of its bottlenecks is the precomputation of the multiple-to-local (M2L) operator, and its higher number of floating point operations (flops) compared to other FMM formulations. Here, we present several optimizations for that operator, which is known to be the costliest FMM operator. The most efficient ones do not only reduce the precomputation time by a factor up to 340 but they also speed up the matrix-vector product. We conclude with comparisons and numerical validations of all presented optimizations.
منابع مشابه
Optimizing the multipole-to-local operator in the fast multipole method for graphical processing units
This paper presents a number of algorithms to run the fast multipole method (FMM) on NVIDIA CUDAcapable graphical processing units (GPUs) (Nvidia Corporation, Sta. Clara, CA, USA). The FMM is a class of methods to compute pairwise interactions between N particles for a given error tolerance and with computational cost of O.N /. The methods described in the paper are applicable to any FMMs in wh...
متن کاملAn Implementation of Low-Frequency Fast Multipole BIEM for Helmholtz’ Equation on GPU
Acceleration of the fast multipole method (FMM), which is the fast and approximate algorithm to compute the pairwise interactions among many bodies, with graphics processing units (GPUs) has been investigated for the last couple of years. In view of the type of kernel functions, the non-oscillatory kernels (especially, the Laplace kernel) were studied by many researchers (e.g. Gumerov), and the...
متن کاملAn Empirical Interpolation based Fast Summation Method for translation invariant kernels
We derive a Fast Multipole Method (FMM) where a low-rank approximation of the kernel is obtained using the Empirical Interpolation Method (EIM). Contrary to classical interpolationbased FMM, where the interpolation points and basis are fixed beforehand, the EIM is a nonlinear approximation method which constructs interpolation points and basis which are adapted to the kernel under consideration...
متن کاملA SVD accelerated kernel-independent fast multipole method and its application to BEM
The kernel-independent fast multipole method (KIFMM) proposed in [1] is of almost linear complexity. In the original KIFMM the time-consuming M2L translations are accelerated by FFT. However, when more equivalent points are used to achieve higher accuracy, the efficiency of the FFT approach tends to be lower because more auxiliary volume grid points have to be added. In this paper, all the tran...
متن کاملA New Adaptive Algorithm for the Fast Multipole Boundary Element Method
A new definition of the interaction list in the fast multipole method (FMM) is introduced in this paper, which can reduce the moment-to-local (M2L) translations by about 30-40% and therefore improve the efficiency for the FMM. In addition, an adaptive tree structure is investigated, which is potentially more efficient than the oct-tree structure for thin and slender domains as in the case of mi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1210.7292 شماره
صفحات -
تاریخ انتشار 2012